Accelerating Falcon on ARMv8
نویسندگان
چکیده
Falcon is one of the promising digital-signature algorithms in NIST’s ongoing Post-Quantum Cryptography (PQC) standardization finalist. Computational efficiency regarding software and hardware also main criteria for PQC standardization. In this paper, we present an efficient implementation on ARMv8 environment. Until now, most optimization have been conducted 32-bit ARM (Cortex-M4) typical CPUs (Intel AMD CPUs). However, including Cortex-A30, 50, 70 series widely used various IoT (Internet Things) applications, Edge computing devices, OBUs (On Board Units) autonomous driving cars. For optimizing performance Falcon, take full advantage NEON engine which a kind parallel processing unit MCU. The computation belongs to polynomial multiplications Complex number domain Integer domain. Typically, FFT (Fast Fourier Transformation)-based multiplication method NTT (Number Theoriteic Transform)-based domain, respectively. Thus, order enhance overall improve FFT-based NTT-based by utilizing ARMv8. Specifically, parallelize process (FFT/NTT transformation, pointwise multiplication, inverse FFT/NTT transformation) with strategically vector instructions. Furthermore, minimize redundant memory accesses during FFT/NTT-based making available registers engine. Through proposed multiplications, provides 15.1% (resp. 18.1%), 16.5% 17.1%), 65.4% 69.4%) improvement keypair generation, signing, verification at security level 1 5) compared reference submitted final round NIST competition. as far know, first optimized
منابع مشابه
Implementing GCM on ARMv8
The Galois/Counter Mode is an authenticated encryption scheme which is included in protocols such as TLS and IPSec. Its implementation requires multiplication over a binary finite field, an operation which is costly to implement in software. Recent processors have included instructions aimed to speed up binary polynomial multiplication, an operation which can be used to implement binary field m...
متن کاملBinary field multiplication on ARMv8
In this paper, we show efficient implementations of binary field multiplication over ARMv8. We exploit an advanced 64-bit polynomial multiplication (PMULL) supported by ARMv8 and conduct multiple levels of asymptotically faster Karatsuba multiplication. Finally, our method conducts binary field multiplication within 57 clock cycles for B-251. Our proposed method on ARMv8 improves the performanc...
متن کاملPromising Compilation to ARMv8 POP
We prove the correctness of compilation of relaxed memory accesses and release-acquire fences from the “promising” semantics of Kang et al. [12] to the ARMv8 POP machine of Flur et al. [9]. The proof is highly non-trivial because both the ARMv8 POP and the promising semantics provide some extremely weak consistency guarantees for normal memory accesses; however, they do so in rather different w...
متن کاملFalcon City: An Integration of Novint Falcon in Virtual Reality
Haptic Devices enable various tactile feedback and allow users to interact with a virtual environment in a more immersive way. The current Virtual Reality (VR) interface in the market is largely comprised of devices designed for visual and auditory feedback, with Head Mounted Display (HMD) devices such as Oculus Rift leading the industry by providing significant visual immersion. However, there...
متن کاملFalcon Temperature Regulation
--We measured tarsal and body temperatures of four species of large falcons in relation to rising ambient temperature and found that the tarsus has an apparent hermoregulatory function. Statistical efforts to separate the birds into ecological or plumage types yielded mixed results. An index of tarsal surface area per unit body weight was correlated with the temperature regimens of the birds, w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3169784